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Abstract 

We study a chain of four interacting rotors (rotators) connected at both ends to stochastic heat 
baths at different temperatures. We show that for non-degenerate interaction potentials the system 
relaxes, at a stretched exponential rate, to a non-equilibrium steady state (NESS). Rotors with high 
energy tend to decouple from their neighbors due to fast oscillation of the forces. Because of this, 
the energy of the central two rotors, which interact with the heat baths only through the external 
rotors, can take a very long time to dissipate. By appropriately averaging the oscillatory forces, 
we estimate the dissipation rate and construct a Lyapunov function. Compared to the chain of 
length three (considered previously by C. Poquet and the current authors), the new difficulty with 
four rotors is the appearance of resonances when both central rotors are fast. We deal with these 
resonances using the rapid thermalization of the two external rotors. 
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1 Introduction 


We consider a chain of 4 classical rotors interacting at both ends with stochastic heat baths at different 
temperatures. Under the action of such heat baths, many Hamiltonian systems are known to relax to 
an invariant probability measure called non-equilibrium steady state (NESS). In general, the explicit 
expression for this invariant measure is unknown, and the convergence rate depends on the nature of 
the system. For the model under consideration, we obtain a stretched exponential rate. 

For several examples of Hamiltonian chains, properties of the NESS (e.g., thermal conductivity, 
validity of the Fourier law, temperature profile,...) have been studied numerically, perturbatively, or via 
some effective theories. See for example [2,3,7,13,15,19,21] for chains of rotors and [1^, 16,20,21] 
for chains of oscillators. From a rigorous point of view however, the mere existence of an invariant 
measure is not evident, and has been proved only in special cases. 

A lot of attention has been devoted to chains of classical oscillators with (nonlinear) nearest 
neighbor interactions. In such models, each oscillator has a position g* G M (we take one dimension 
for simplicity), is attached to the reference frame with a pinning potential U{qi), and interacts with its 
neighbors via some interaction potentials W {qi+i — qi) and W {qi — qi-i). 

It turns out that the properties of the chain depend crucially on the relative growth of W and U at 
high energy. In the case of (asymptotically) polynomial potentials, and for Markovian heat baths, it has 
been shown [5,9-12,27] that if W grows faster than U, the system typically relaxes exponentially fast 
to a NESS. The convergence is fast because, thanks to the strong interactions, the sites in the bulk of 
the chain “feel” the heat baths effectively even though they are separated from them by other sites. 

In the strongly pinned case, i.e., when U grows faster than W, the situation is more complicated. 
When a given site has a lot of energy, the corresponding oscillator essentially feels only its pinning 
potential U{qi) and not the interaction. Assume U{q) oc q^^ with k > 1. An isolated oscillator 
pinned with a potential U and with an energy E oscillates with a frequency that grows like £'i/2-i/2fc 
This scaling plays a central role, since the larger the energy at a site, the faster the corresponding qi 
oscillates. But then, the interaction forces with the sites i + 1 and i — 1 oscillate very rapidly and 
become ineffective at high energy. Therefore, a site (or a set of sites) with high energy tends to decouple 
from the rest of the chain, so that energy can be “trapped” in the bulk. This mechanism not only makes 
the convergence to the invariant measure slower, but it also makes the proof of its existence harder. The 
case where W is quadratic is considered in [18]. There, Hairer and Mattingly show that if U{q) oc q^^ 
with k sufficiently large, no exponential convergence to an invariant measure (if there is one) can take 
place. Moreover, they show that an invariant measure exists in the case of 3 oscillators when k > 3/2. 
The existence of a NESS for longer chains of oscillators remains an open problem when the pinning 
dominates the interactions. 

Chains of rotors are in fact closely related to strongly pinned oscillator chains: The frequency of a 
rotor scales as where E is its energy. This scaling corresponds to that of an oscillator in the limit 
/c —)■ oo, for the pinning U{q) oc q^^ discussed above. In this sense, our chain of rotors behaves as a 
chain of oscillators in the limit of “infinite pinning”, which is some kind of worst-case scenario from the 
point of view of the asymptotic decoupling at high energy. On the other hand, the compactness of the 
position-space (it is a torus) in the rotor case is technically very convenient. The problems appearing 
with chains of strongly pinned oscillators are very similar to those faced with chains of rotors, and so 
are the ideas involved to solve them. 

The existence of an invariant measure for the chain of 3 rotors has been proved in [6], as well as 
a stretched exponential upper bound of the kind exp(—c\/f) on the convergence rate. The methods, 
which involve averaging the rapid oscillations of the central rotor, are inspired by those of [18] for the 
chain of 3 oscillators. 

In the present paper, we generalize the result of [6] to the case of 4 rotors, and obtain again a bound 
exp(—c\/f) on the convergence rate. The main new difficulty in this generalization is the presence 
of resonances among the two central rotors. When they both have a large energy, there are two fast 
variables and some resonant terms make the averaging technique developed in [6] insufficient. A 
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large portion of the present paper is devoted to eliminating such resonant terms hy using the rapid 
thermalization of the external rotors. 

It would he of course desirable to he able to work with a larger number of rotors. The present 
paper uses explicit methods to deal with the averaging phenomena. We hope that by crystallizing the 
essentials of our methods, longer chains can be handled in the same spirit. We expect that for longer 
chains, the convergence rate is of the form exp(—ct^), for some exponent k G ( 0 , 1 ) which depends on 
the length of the chain. We formulate a conjecture and explain the main difficulties for longer chains in 
Remark 5.3. 

We now introduce the model and state the main results. In §2, we study the behavior of the system 
when one of the two central rotors is fast, and construct a Lyapunov function in this region. In §3, we 
do the same in the regime where both central rotors are fast. In §4 we construct a Lyapunov function 
that is valid across all regimes, and in §5 we provide the technicalities necessary to obtain the main 
result. 

1.1 The model 



Figure 1 - A chain of four rotors with two heat baths at temperatures Ti and Ti. 

We Study a model of 4 rotors, each given by a momentum p* G M and an angle t/j G T = M/27rZ, 
i = 1,..., 4. We write in the sequel q= (gi,.. •, 94 ) G T^, p = (pi,.. • ,P 4 ) G M^, and x = {q,p) G 
n, where If = T*^ x is the phase space of the system. We consider the Hamiltonian 

4 2 

= E y + ^L(g 2 - qi) + Wciq^i - (? 2 ) + - Qa) , ( 1 - 1 ) 

i=l 

where Wi ;T —)-M, I = L,C,R (standing for left, center and right) are smooth 27r-periodic interaction 
potentials (see Figure 1). 

Convention: Unless specified otherwise, the arguments of the potentials are always as above, namely 
IUl = 1Tl( 92 — fZi). Wc = Wc{q‘i — ( 72 ) and Hr = IUR(g 3 — ( 74 ). The same applies to any function 
with index L, C and R. Note that the argument for R is (73 — q^ (and not (74 — ( 73 ) since this choice will 
lead to more symmetrical expressions between the sites 1 and 4. 

To model the interaction with two heat baths, we add at each end of the chain a Langevin thermostat 
at temperature Ti, > 0, with dissipation constant 7 ^ > 0, 6 = 1,4. Introducing the derivative of the 
potentials wi = W{, I = L, C, R, the main object of our study is the SDE: 

dqi=pidt, i = l,...,4, 

dpi = (rPL - 7iPi) dt + ^/2-fiTidBl , 

dp 2 = (wc - wl) dt , ( 1 . 2 ) 

dp3 = -{wc + wr) dt , 

dp4 = {wR - 74 P 4 ) dt + v^274T4dRi^ , 
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where B}, Bf are independent standard Brownian motions. The generator of the semigroup associated 
to ( 1 . 2 ) reads 


4 

L = ^^Pidq^ + wi, ■ (9pi ~ *^^ 2 ) ■ {dp2 ~ dp^) + u)R • (9p4 — dp^) 

i=l 

+ {-IbPbdp, + ibTbd'^J . 
b=l,4 


(1.3) 


Remark 1,1, In contrast to [ 6 ], we do not allow for the presence of pinning potentials U(qi) and of 
constant forces at the ends of the chain, although we believe that the main result still holds with such 
modifications. While constant forces would be easy to handle, the addition of a pinning potential would 
require some generalization of a technical result (Proposition 3.12) which we are currently unable to 
provide (see Remark 3.13). 

We consider the measure space (fl, B), with the Borel fi-field B over fl. The coefficients in (1.2) 
are globally Lipschitz, and therefore the solutions are almost surely defined for all times and all initial 
conditions. We denote the transition probability of the corresponding Markov process by P*{x, •), for 
all X G and t >0. 


1,2 Main results 

We will often refer to the sites 1 and 4 as the outer (or external) rotors, and the sites 2 and 3 as the 
central rotors. We require the interactions from the inner rotors to the outer rotors to be non-degenerate 
in the following sense: 

Assumption 1,2, We assume that for I = L, R and for each s G T, at least one of the derivatives 
(s), k > I is non-zero. 

This assumption is not very restrictive. In particular, it holds if all the potentials consist of finitely 
many nonconstant Fourier modes. 

Our main result is a statement about the speed of convergence to a unique stationary state of the 
system (1.2). In order to state it, we introduce for each continuous function / : 0 —)■ (0, 00 ) the norm 
II • II j on the space of signed Borel measures on 11: 

gdv . 


Mf = sup / 


If / = 1, we retrieve the total variation norm. 

Theorem 1,3 (NESS and rate of convergence). Under Assumption 1.2, the Markov process defined 
by ( 1 . 2 ) satisfies: 

(i) The transition probabilities P^{x, dy) have a C°°((0, 00 ) x 11 x H) density pt{x, y). 

(ii) There is a unique invariant measure tt, and it has a smooth density. 

(Hi) ForallO < 9i < min(l/Ti, l/r 4 ) and all 62 > Oi, there exist constants C, X > d such that for 
all X = (qi,q 2 ,... ,^ 4 ) G H and all t > 0, 


\\P\x, •) - ttW^b^h < ^ 

At thermal equilibrium, namely when Ti = T 4 = 1//3 for some /? > 0, the invariant measure is the 
Gibbs distribution with density jZ, where Z is a normalization constant. 

Theorem 1.3 will be proved in §5 with help of results of [ 8 ] and the existence of a Lyapunov 
function, the properties of which are stated in 
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Theorem 1.4 (Lyapunov function). Let 0 < 6 * < min(l/Ti, I/T 4 ). Under Assumption 1.2, there is a 
function L ; —)■ [l,oo) satisfying: 


(i) There are constants ci > 0 and a £ (0,1) such that 

(ii) There are a compact set K and constants 02,03 > 0 such that 

Lv < C2IX - T{y) > 

with p '.[ 1 , 00 ) —)■ ( 0 , 00 ) the increasing, concave function defined by 


p{s) = 


03 S 


2 + log(s) ■ 

Most of the paper will be devoted to proving the existenee of sueh a Lyapunov funetion. 


(1.5) 


( 1 . 6 ) 


(1.7) 


Remark 1.5. We assume throughout that Ti and T 4 are strictly positive. While the conclusions of 
Theorem 1.4 remain true for Ti = T 4 = 0 (with any 9 > 0), part of the argument has to be changed 
in this case, as sketched in Remark 3.17. The positivity of the temperatures is, however, essential 
for Theorem 1.3; at zero temperature, the system is not irreducible, and none of the conclusions of 
Theorem 1.3 hold. 


1.3 Overview of the dynamics 

To gain some insight into the strategy of the proof, we illustrate some essential features of the dynamics 
(1.2). Since the exterior rotors (at sites 1 and 4) are directly damped by the —yi,Pb df terms in (1.2), 
we expect their energy to decrease rapidly with large probability. More specifically, for b = 1,4, we 
find that Lpi, is equal to —ybPb plus some bounded terms, and thus we expect pb to decay exponentially 
(in expectation value) when it is large. Therefore, the external rotors recover very fast from thermal 
fluctuations, and will not be hard to deal with. 

On the other hand, the central rotors are not damped directly, and feel the dissipative terms of (1.2) 
only indirectly, by interacting with the outer rotors. The interesting issue appears when the energy of 
the system is very large and mostly concentrated in one or both of the central rotors. If most of the 
energy is at site 2 (meaning that |p 2 | is much larger than all other momenta), the corresponding rotor 
spins very rapidly, i.e., q 2 moves very rapidly on T. But then, the interaction forces wi^{q 2 — qi) and 
^uc (<?3 ~ 72 ) oscillate rapidly, which causes the site to essentially decouple from its neighbors. The 
same happens when most of the energy is at site 3, when wc{q 3 — 92 ) and wvi{q 3 — q^) oscillate rapidly. 
And when both |p 2 | and \p 3 \ are large and much larger than |pi| and |p 4 |, the forces wi^{q 2 — qi) and 
^ur(73 ~ Qi) are highly oscillatory, so that the central two rotors almost decouple from the outer ones 
(the force wq might or might not oscillate depending on p 2 and pa). 

This asymptotic decoupling is the interesting feature of the model: in principle, if the central 
rotors do not recover sufficiently fast from thermal fluctuations, the energy of the chain could grow 
(in expectation value) without bounds. On the other hand, when their energy is large, the decoupling 
phenomenon should make the central rotors less affected by the fluctuations of the heat baths. Our 
results imply that both effects combine in a way that prevents overheating. See [6, Remark 3.10] 
for a quantitative discussion of these two effects for a chain of three rotors. See also [17] for a clear 
exposition of the overheating problem in a related model. 

Figure 2 illustrates the evolution^ of the momenta at two different time scales, starting with 
p(0) = (50, 20, 30,40). The upper graph shows that indeed pi and p 4 decrease very fast, and the lower 

* The numerical algorithm used in this paper is based on the one described in [19]. The time step is either 10“^ or 10“^ 
depending on the situation. 
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Figure 2 - Evolution of the momenta pi,... ,P 4 , for 71 = 74 = 1, T\ = 1, T 4 = 10, 
q(0) = (0,0, 0, 0) andp(O) = (50, 20,30,40). The interaction potentials in the simulations 
here are Wi — — cos so that the forces are wi = sin, I = L, C, R. 


graph indicates that p 2 and ps remain large for a significantly longer time, but eventually also decrease. 
Since for this initial condition ps is larger than p 2 , the force wji oscillates faster than rcL- Therefore, 
P 3 couples less effectively to the outer rotors (where the dissipation happens) than p 2 , and hence p^ 
decreases more slowly. 

If one were to look at these trajectories for much longer times, one would eventually observe some 
fluctuations of arbitrary magnitude, followed by new recovery phases. But large fluctuations are very 
rare. 

Since the system is rapidly driven to small pi, pi, it is really the dynamics of {p 2 ,Pz) that plays the 
most important role. We will often argue in terms of the 8-dimensional dynamics projected onto the 
P 2 P 3 -plane. We illustrate some trajectories in this plane for several initial conditions in Figure 3. To 
make the illustration readable, we used a very small temperature, so that the picture is dominated by 
the deterministic dynamics. 

The typical trajectory is as follows. Starting with some large |p 2 | and jpaj, the slower of the two 
central rotors is damped faster than the other, so that the projection drifts rapidly towards one of the 
axes. This leads to a regime where only one of the central rotors is fast, while the other is essentially 
thermalized. The energy in this fast rotor is gradually dissipated, so that the orbit follows the axis 
towards the origin. 

The behavior that we observe in Figure 3 around the diagonal p 2 = ps far enough from the origin is 
easily explained: in the “center of mass frame” of the two central rotors, we simply see two interacting 
rotors that oscillate slowly in opposition, while being almost decoupled from the outer rotors. More 
precisely, introducing Q = 93 — 92 £ T and P = p 3 — p 2 £ we see that {Q,P) acts approximately 
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Figure 3 - The evolution of p 2 and p 3 for several initial conditions. The potentials are 
Wi = — cos, I = L, C, R. Furthermore, 71 = 74 = 1 , Ti = 0.1, and T 4 = 0.4. Each “x” 
sign indicates the beginning of a trajectory. 


as a mathematical pendulum with potential 2Wc, plus some rapidly oscillating (and therefore weak) 
interactions with the outer rotors: 

Q = P, P = —2wc{Q) + weak interactions. 

Typically, if at first the energy in the center of mass frame is not large enough to make a “full turn,” 
Q oscillates slowly around a minimum of Wq, which corresponds to a hack-and-forth exchange of 
momentum between 2 and 3, and explains the strips that we observe around the diagonal. The two 
central rotors are then gradually slowed down, until at some point the interaction with the external 
rotors tears them apart. 

The picture in the absence of noise (that is, when Ti = T 4 = 0, which is not covered by our 
assumptions) is quite different, due to some resonances. We discuss their nature in Appendix A. These 
resonances are washed away by the noise, and are therefore not visible here. They nevertheless play an 
important role in our computations, as we will see. 

1.4 Strategy 

In order to obtain rigorous results about the dynamics and construct a Lyapunov function, we will 
apply specific methods to each regime described above. We present them here in increasing order of 
difficulty. 

• When a significant part of the energy is contained in the outer rotors, then as discussed above, 

the momenta of the two outer rotors essentially decrease exponentially fast. In this region, 
the Lyapunov function will be , and we will show that when pf + is large enough and 
9 < min (1/Ti, I/T 4 ), then (Lemma 4.1). 

• When most of the energy is contained at just one of the central sites, namely at site j = 2 or 

j = 3, we will show that Lpj ~ ~Pj^ when averaged appropriately (Proposition 2.2). This 
corresponds to the neighborhood of the axes in Figure 3. This case is essentially treated as 
in [6]. In this region, we use a Lyapunov function Vj ~ (with a G (0,1)) such that 

LVj < —Vj/p'j (Proposition 2.4). 
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• When both \p 2 \ and \p 3 \ are large and hold most of the energy, we do not approximate the 

dynamics of p 2 and ps separately, but we consider instead the “central” Hamiltonian He = 

2 2 

^ ^ + VFl + Wc + Hr. We show that when averaged properly, LHc ~ —— P 3 ^ 

(Proposition 3.2). The Lyapunov function in this region is Vc ~ and we show (Proposi¬ 
tion 3.5) that LI 4 < —Vc/Hc. Showing that LHc ~ ~P 2 ^ most difficult part of 

our proof. The averaging of the rapidly oscillating forces will prove to be insufficient due to 
some resonances, which manifest themselves for some rational values of p^/p 2 - We will consider 
separately the vicinity of the p 2 = ps diagonal, which is easy to deal with (Lemma 3.7), and 
the case where |p 3 — P 2 I is large, which requires substantially more work (§3.3). In the latter 
case, we will use the rapid thermalization of the external rotors in order to eliminate the resonant 
terms. 

The factors l/p^ and 1 /He in LVj < —Vj/p^j and LVe < —Vc/Hc are the cause of the logarithmic 
contribution in (1.7), which leads to the subexponential convergence rate. 

The final step (§4) is to combine V 2 , V 3 and I 4 (which each behave nicely in a given regime) 
to obtain a Lyapunov function V that behaves nicely everywhere and satisfies the conclusions of 
Theorem 1.4. 


1.5 The domains 

Following the discussion above, we decompose Ll into several sub-regions. This decomposition only 
involves the momenta, and not the positions. All the sets in the decomposition are defined in the 
complement of a ball Hr of (large) radius R in p-space: 

4 

Hr = X |p e ; ^pf < . 

i=l 

For convenience, we consider only R > \/2 (see Remark 1.6). We also use (large) integers k, i, 
and m which will be fixed in §4, and we assume throughout that 

1 < k < i < m . ( 1 . 8 ) 


The first regions we consider are along the p 2 and p 3 axes: 

Q 2 = Ll 2 {k, R) = G H ; P2 > (pf + P3 + \ Br , 

Q 3 = Ll3{k,R) = |x G H : p| > (pf +pi -f P4)''| \ Br . 


(1.9) 


The region H 2 (resp. H 3 ) corresponds to the configurations where most of the energy is concentrated at 
site 2 (resp. 3). The next region corresponds to the configurations where most of the energy is shared 
among the sites 2 and 3: 


Lie = Hc(f,m,H) = |x G H ; P2 -f pi > {pf+pl)'^, pf > pj, pf > pij \ Br (1.10) 


(the conditions pi^ > pi and pi^ > pi ensure that both |p 2 | and |p 3 | diverge sufficiently fast when 
IIpII —)■ 00 in HJ. These regions are illustrated in Figure 4 and Figure 5. Note that H 2 , LI 3 , Lie do 
intersect and do not cover Ll. However, for R large enough, the set H 2 U H 3 U He U Br contains the 
P2P3-plane (more precisely, the product of and some neighborhood of the P2P3-plane in momentum 
space), which is where the determining part of the dynamics lies, as discussed above. 

Remark 1.6. As a consequence of the restriction R > \/2, we have Llj{k', R') C Llj{k,R) for all 
k' > k, R! > R, and j = 2,3. Therefore, if a bound holds for all x G Llj{k, R), it also holds for 



Vpi+pI 



Figure 4 - A projection of the domains The spherical surface represents 

Pi ~ some C > R. 


P3 



Figure 5 - The intersection of the sets fl 2 ,fl 3 , flc with the p 2 P 3 -plane (the lower half-plane 
is obtained by axial symmetry). 


all X G ^j{k', R'). Similarly, at fixed £, we have Oc(^, m', R') C m, R) for all m' > m and 
R! > R. This allows us to increase k, m and R as needed (hut not i). We also observe immediately 
that for all fc, f, m, and for j = 2,3, 

lim inf |p, | = oo , (1-11) 

H—too x£flj{k,R) 

lim inf \pj\ = cx) . (1-12) 

R^oo xGQc{^,'fn,R) 


1.6 Notations 

Since averaging functions that rapidly oscillate in time will play an important role, we introduce the 
(?j-average (/) • = ^ fdqi of a function / : fl —)• M over one period of qi. The result is a function 
of p and {gj : j / i}. In the presence of a generic function / : T —)• M of one variable, we write simply 
~ ^ fo^ /('S)ds, which is a constant. 

For any function / : T —M satisfying (/) = 0, one can find a unique integral F : T —)> M such 
that F' = f and (F) = 0. More generally, we write /l-^l for the integral of / that averages to zero. 
Without loss of generality, we fix the additive constants of the potentials so that 

(VFi)=0, I = L,C,R. (1.13) 


9 












We also introduce two “effective dissipation constants”: 

02 = 71 {wl) > 0 , 03 = 74 (VIr) > 0 , (1.14) 

where the positivity follows from Assumption 1.2. Note also that because of (1.13), there is no 
indeterminate additive constant in the aj. 

Finally, throughout the proofs, c denotes a generic positive constant that can he each time different. 
These constants are allowed to depend on the parameters and functions at hand, hut not on the position 
X. We sometimes also use d to emphasize that the constant has changed. 

2 When only one of the central rotors is fast 

We consider the regime where either \p 2 \ or |p 3 | (hut not both) is much larger than all other momenta. 
The estimates for this regime are simple adaptations from [ 6 ], but we recall here the main ideas. 

We start with some formal computations, thinking in terms of powers of p 2 (resp. ^ 3 ) only. Then, 
we will restrict ourselves to the set Q. 2 {k, R) (resp. R)) for some large enough k and R, so that 
the other momenta are indeed “negligible” (see Lemma 2.3) compared to p 2 (resp. ps). 

2.1 Averaging with one fast variable 

Assume that \p 2 \ is much larger than the other momenta. We think in terms of the following fast-slow 
decomposition: the variables qi,q 3 , q^ and p evolve slowly, while q 2 evolves rapidly, since q 2 = P 2 , 
and p2 is large. In this regime, the variable q2 swipes through T many times before any other variable 
changes significantly. The dynamics for short times is 

p{t) ^ p{0) , 

f = l,3,4, (2.1) 

g' 2 (f) ~ 92 ( 0 )+P 2 ( 0 )f (mod 27r) . 

We consider an observable / : 12 —M and let g be defined by 

Lf = g. (2.2) 

Under fhe approximafion (2.1), fhe quantify g{x{t)) oscillafes very rapidly around ifs q 2 -average ( 5 ) 2 , 
which is a function of the slow variables qi, 53 , q^ and p. We therefore expect the effective equation 
Lf ss ( 5^)2 to describe the evolution of / over several periods of oscillations, and we now show how to 
give a precise meaning to this approximation. 

Although the stochastic terms (which appear as the second-order part of the differential operator L) 
appear in the computations, they do not play an important conceptual role in this discussion; the rapid 
oscillations that we average are of dynamical nature and are present regardless of the stochastic forcing 
exerted by the heat baths. 

The generator of the dynamics (2.1) is simply 


L 2 = P2dg2 . 

Decomposing the generator L defined in (1.3) as L = L2 + {L — L2) and considering powers of 
P 2 , we view L 2 as large, and L — L 2 as small. Note thaf for all smooth /i : U —)■ M, we have 
(^ 2 / 1)2 = P 2 (< 992^)2 = 0 by periodicity, so that the image of L 2 contains only functions with zero 
( 72 -average. Consider next the indefinite integral G = f(g — (5)2)452 (we choose the integration 
constant C(qi, 53 , q4,p) to our convenience). By construction, we have L2{G/p2) = g — ( 5 ) 2 ? so that 

L(f--) ={g)^ + {L 2 -L)-. (2.3) 

\ P 2 J P2 
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By subtracting the “small” counterterm G/p 2 from /, we have managed to replace g with its g 2 -average 
in the right-hand side, plus some “small” correction. This procedure is what we refer to as averaging 
with respect to q 2 , and it makes sense only in the regime where |p 2 | is very large. If ( 5)2 = 0 and 
(L 2 — L){G/p 2 ) is still oscillatory, the procedure must be repeated. 


2.2 Application to the central momenta 


We now apply this averaging method to the observable p 2 , in the regime where |p 2 | is very large. By 
the definition of L, we find 

Lp2 = me - rcL . (2.4) 


We have (^ 0)2 = (^^^L )2 = 0- Moreover, dg^Wciqs - 92 ) = -wciqs - ^ 2 ) and dg^WL{q 2 - qi) = 
wi,{q 2 — qi). Thus, in the notation above, G = J{wq — wi,)dq 2 = —Wq — VTl, and we introduce the 
new variable 


(1) G , ITc + VFl 

P 2 =P2 - =P2-\ - 

P2 P2 


(2.5) 


By (2.3), we obtain 




Pzwc - pircL Wcw\, - Wi,wc + VT'lw'l - Wewe 


P2 


pI 


(2.6) 


Observe that the right-hand side of (2.6) is still oscillatory, but now with an amplitude of order l/p 2 . 
which is much smaller than the amplitude of (2.4) when |p 2 | is large. Furthermore, the right-hand side 
of ( 2 . 6 ) has zero mean, since (^ 0)2 = {'^ 1)2 ~ ^ and 

{Wcwi^ - Wi^wc + - Wcwc)2 = ^ (dg,(Wc + IFl )")2 = 0 , 


by periodicity. In order to see a net effect, we need to average again. We consider now the observable 
/ = P 2 ^\ and apply the same procedure. Instead of averaging the right-hand side of (2.6) in one step, 
we first deal only with the terms of order — 1 in p 2 > by introducing 


(2,^ (D + pan+PaH-c 

P 2 

We postpone further computations to the proof of Proposition 2.2 below, and explain here the main 
steps. We will see that Lp 2 consists of terms of order —2 and —3 (by construction, the contribution 

of order —1 disappears). The terms of order —2 have mean zero, and will be removed by introducing 

f3) 

a new variable P 2 ■ We will then find that Lp 2 contains terms of order —3 and —4. To replace the 
terms of order —3 with their average (which is finally non-zero), we will introduce a function p^^'^. This 
will complete the averaging procedure. 

We illustrate in Figure 6 the time-dependence of P 2 ,p '2 ^ and p^^'^ (slightly shifted for better 

9 (i) 

readability) . Clearly, the oscillations of P 2 are much smaller than those of p 2 , and we barely perceive 

f2) 

the oscillations of P 2 \ since they are smaller than the random fluctuations. 

Before we state the result of this averaging process, we introduce a convenient notation for the 
remainders. 

^The irregularity of the envelope of p 2 in Figure 6 is due to the randomness of the phases of the two oscillatory forces wl 

(2) 

and wc'- they sometimes add up, and sometimes compensate each other. Note also that the trajectory of pj is rougher than 

j'2'\ 

the other two, since the definition of ' involves pi, which is directly affected by the stochastic force. 
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/ 


Figure 6 - The effect of the coordinate changes (2.5) and (2.7) on the effective oscillations. 
Note that two of the curves are shifted vertically for easier readability. 


Definition 2.1. Let /, g be two functions defined on the set {x : p 2 0}. We say that f is 02 (g) 
if there is a polynomial z such that when \p 2 \ is large enough, 

\f{x)\ < z(pi,p 3 ,p 4 )\g{x)\ . (2.8) 


The analogous notation O 3 will be used when {p^l is large, and with a polynomial z{pi,p 2 ,P 4 )- 


This notation reflects the fact that when most of the energy is at site 2 (resp. 3), one can forget 
about the dependence on pi,P 3 ,P 4 (resp. pi,P 2 -,pf), provided that it is at most polynomial (hy the 
compactness of T^, the position q is irrelevant). For example, the term {piWi, + psVFc) Ip\ in (2-V) is 
02 {pf\ 

It is easy to realize that the Oj, j = 2,3, follow the same basic rules as the usual O. In particular, 

Oj{gi) + Oj{g 2 ) = Oj{\gi\ + \g 2 \) and Oj{gi)Oj{g 2 ) = Ojigm). 

Proposition 2.2. There are functions p 2 and ps of the form 


P2=P2 + 
P3=P3 + 


P2 

P3 


+ 

+ 


PiWi^ + PsWc 


Pi 


PaWb. + p2Wc 
Pi 


+ C> 2 {P 2 j 
+ £’ 3(^3 , 


such that for j = 2, 3, 

Lpj = -ajpj^ + , 

where 02 > 0, 03 > 0 are defined in (1.14). Furthermore, 




dp^P2 = — +02{P2 ) , dp^P2 = 02{P2 ) 
P 2 


dpiP3 = OsiPs 


- 4 \ 


Wb , 

dpj3 = ^ + ^ 3(^3 ) • 
P3 


(2.9) 

( 2 . 10 ) 

( 2 . 11 ) 


( 2 . 12 ) 


Proof. It suffices to consider the case j = 2. The variable p 2 is constructed as m [ 6 ]. We continue the 

( 2 ) 

averaging procedure started above. It is easy to check that Lp 2 can be written as 



P2 


2pi (VFlwl - Wi,wc) + 2p3 (ITcruL 

pI 


Wcwc) 


with 

Ri = -pIWi^ - pIWc - + WlWc + + wIj%r . 
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f3) 

Since it is a total derivative, the term of order —2 has zero (72-average, and hy introducing P2 = 
P? - we find 


Lpf = 


dq2R2 + + Pi (Idc^L - 2 Wi,wc) + P 3 { 2 Wcwi, - Wlwc) 

pI 


+ 02 {P: 


-4n 


( 2 . 13 ) 


with 


R2 = -p\Wi^ - pIWc - ' + ^PiWl 

+ 27 irilf^W ^ + SpsW^ + SpsWi^^WK • 


One can then average the terms of order -3 in ( 2 . 13 ). We have again {dq^R2)2 = 0 hy periodicity, and 
after integration hy parts we find 


{Wcwi:)2 = {wcWi)2 and ^ = -71 {wl)^ = -02 

(for fhe signs, recall thaf VFl = 1 ^l (92 — Qi) and Wq = ffc(<?3 — <?2))- By adding appropriate 
counterterms (nol wriffen explicifly), we ohfain a funclion p^^'^ = p® + 02(p^^) such fhaf 



a2 {p^Wi^WQ-PiWcwi,)^ 

o “T o 

P2 P2 


+ 02{P2^). 


The first term in the right-hand side is the one we are looking for, and we deal with the other term of 
order —3 (which is non-zero) as follows. We observe that 


{p^Wl^Wc-piWcWi:)2 = {pidq, +p^dq,) {WlWc )2 = L {Wl^Wc)2 , 

since (1 TlB^c) 2 a function of qi, only. We then set 

( 4 ) (i^Li^c )2 

P2-P2 - 3 - 

P 2 

and obtain ( 2 . 11 ). It is immediate by the construction of p2 that ( 2 . 12 ) holds. □ 

We now introduce a lemma, which says that remainders of the kind Oj{\pj\~^), j = 2 , 3 , can be 
made very small on Qj{k, R), provided that the parameters k, R are large enough. 

Lemma 2.3. Let j £ {2,3} and r > 0 . Fixe > Q and a function f = Oj{\pj\~'^). Then, for all 
sufficiently large k and R, we have 


sup |/(x)|<e. 

{k,R) 

Proof We prove the result for J = 2 . By Definition 2 .1 and (1.11), there is a polynomial z such that 
for all large enough R and all k, we have |/| < 5;(pi,P3,P4)|p2|~^ on Q2{k, R). But then, we have on 
the same set 

^f^^z{pi,p3,pi) c + c{pl+pl+pl)^ c + c\p2\^^ I |2^_^ 

--Sr-■ 

where the second inequality is immediate for sufficiently large N, the third inequality comes from the 
definition of LI2, and the fourth inequality holds because |p2| is bounded away from zero on 122(/c, R). 
Recalling (1.11), we obtain the desired result when k is large enough so that 

We now construct partial Lyapunov functions in the regions D2 and D3. 
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Proposition 2.4. Let 0 < 0 < min(l/Ti, I/T 4 ) and a G (0,1). Consider the functions^ 


V2 = (1 + F2{q2 - qi)/pl) , 14 ) 

4^3 = elP3l“+fp|(l +Fate-<?4)/pi) , 

with the pj of Proposition 2.2, and F 2 , F 3 : T —)■ M such that respectively -^ 2 ( 5 ) = — 

Wl^{s)) andF^{s) = 9‘^'yfri{{W^ — W^{s)). Then, there are constants Ci,C 2 , C'i > 0, independent 
of a G (0,1), such that for all sufficiently large k and R, we have for j = 2,3 the following inequalities 
on Llj.- 

F3elPil“+|p| < Vj < , (2.15) 

LVj < . (2.16) 

Proof By symmetry, it suffices to prove the result for j = 2. In this proof, we do not allow the O 2 to 
depend on a G (0,1) (that is, we want the bound (2.8) to hold uniformly in a G (0,1)). We start by 
proving (2.15). For large enough R, we have that |p 2 | >2 on LI 2 . Moreover, since P 2 = P 2 + 02(^2 
and F 2 {q 2 — qi)/P 2 = ^^ 2 (^ 2 ^)’ Lemma 2.3 that for large enough k, R, it holds on LI 2 that 


IP2I > 1 and 


F 2 iq 2 - qi) 


pI 



( 2 . 17 ) 


Moreover, since both |p2| and |p2| are > 1 , ( 2 . 9 ) implies, for all a G(0,1), 

\\p2r - \P2n < \P2 - pI\ = |2(^Ll + TLc) + 02{pf^) 


Since PFl and FFc are bounded, it follows from Lemma 2.3 that we can bound the right-hand side by a 
constant, so that we find 

celP2l“+fpi < elP2l“+fp| < c'elP2l“+fpi , ( 2 . 18 ) 


uniformly in a. By fhis, by fhe definilion of V2, and by ( 2 . 17 ), we obfain ( 2 . 15 ). We now prove ( 2 . 16 ). 
Lef f{s) = and note fhaf 


F(elP2l“+2Pl) = Lf{p2) = f{p2)Lp2 + f'{p2) ^ lbTb{dp^P2f ■ (2.i9) 

6=1,4 

By Proposition 2.2, we have on ^2 thaf 

f{p2)Lp2 = el^2l“+fp| (Qp^ {-a2P2^ + 02{pf*)) 

\ P2 J (2.20) 

= (^_(^.^Qp-‘2 02{pf^)) , 

where we have used lhalp 2 = P 2 + 02{pf^), and fhaf a|p 2 |“~^ < 1 (since |p 2 | > 1 ), so fhaf on LI 2 , 
fhe 02(^2 obfained is indeed uniform in a. Next, one can verify that uniformly in a G (0, 1 ) and 
|P 2 | > 1 , 


f”iP 2 ) < f{P 2 ) { 9 ‘^P 2 + 201^2! + c) . 

Moreover, by (2.12) we have J2b=i,4 lhTb{dp^P 2 )‘^ = 7iTi4Ll/p2 + C> 2 {pf^), so that on ^ 2 , 

f"{P2) 'ybTb{dp^P2f < (6(2p| + 26'|p2| + c) r7iri^ + £>2(^2 

6=1,4 ^ ^2 

= elP2|“ + fp| + 02(P2 • 

3 The role of the contribution is to facilitate the patchwork that will lead to a global Lyapunov function in §4. The 
corrections involving F 2 and F 3 help average some Wl and that appear in the computations. Without this correction, we 
would need a condition on 6 that is more restrictive than the natural condition 6 < minfl/Ti, 1 /T 4 ). 
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Therefore, by (2.19), (2.20) and (2.21), 


L(elP2r+fp|) < J_elP2r+fp| + e'^-fiTiWl + 02{P2^)) 

P 2 


But then 


LV2={1 + 


F2{q2-qi) \ ^ L\p,r+lpi\ ^ ^Ip^r+lpi^ ( F2{q2 - qi) 
vl ) \ / \ pI 


< 1 + 


P2 

F2{q2-qi)\ 1 ^|p2|“+|p2 


pI 


_elP2|“+5P2 (_a20 + + 02{P2^)) 

P 2 




Pi 


= _elP2r+2P2 (_a20 + {Wl) + 02(P2 ^)) 

P 2 


Using the definition of a 2 in (1.14) and the condition on 0, we find that —a20 + 9‘^'jiTi (IUl) is 
negative. Using then Lemma 2.3 to make the 02(^2 small, and combining the result with (2.18) 

completes the proof. □ 


3 When both central rotors are fast 

We now study the regime where both \p2\ and jpaj are large (not necessarily of the same order of 
magnitude), and \pi \ and \p4\ are much smaller. We then have two fast variables: q2 and 53. As we will 
see, this will lead to some trouble related to resonances, and averaging the rapid oscillations will not be 
enough. We start with some formal computations thinking in terms of powers of p2 and ps, and then 
restrict ourselves to the set Uc(f', m, R) for some appropriate parameters. 

3.1 Averaging with two fast variables: resonances 

Now the fast-slow decomposition is as follows: qi, q^ and p are the slow variables, and q2, qs are the 
fast variables, with the approximate dynamics (for short times) 

p{t) ^ p{0) , 

qi{t)^qi{0), i = l,4, 

92(f) ~ 92(0) +P2(0)f (mod 27 r) , 

93(f) ~ 93(0) +P3(0)f (mod 27 r) , 

generated by L2 + L3 = P2dq^ + p^dq^, which we see as the most important contribution in L. Let 
again /, p : U —)■ M and assume that 

Lf = g . 

We would like, as above, to add a correction to / in the left-hand side in order to replace g with its 
average in the right-hand side. However, since the fast motion of ((72, 93) on (in the dynamics ( 3 . 1 )) 
follows orbits that are open or closed depending on whether p2 and p3 are commensurable or not, there 
seems to be no natural notion of “average of g” that is continuous with respect to the slow variables. 

Consider for example g{x) = sin( 2(72 — 93)- In our approximation, sm{ 2 q 2 {t) — 93(f)) oscil¬ 
lates with frequency (2p2 — Pz)I‘^t^- Thn average is zero when p3 / 2p2, and sin(2(72(f) — 93(f)) 
remains constant when p3 = 2 p 2 - When ps is close to 2 p 2 , the oscillations are slow, and one cannot 
simply average sin( 2 ( 72 (f) — 93(f))- More generally, any smooth function p on U can be written as 
mez ^n,m sin(ng2 + mqi, + Pn,m) for some coefficients an,m and ipn,m which depend on the slow 
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variables gi, 94 and p. Each such term gives rise to problems close to the line P 3 /P 2 = —njm in the 
P2P3-plane. 

However, if g depends on q 2 but not on q^, then no problem appears. In the approximation (3.1), 
the quantity g{x{t)) then oscillates rapidly around (( 7 ) 2 , which is then a function of the slow variables 
qi, q 4 and p. Then, as in §2.1, we use G = f(g — (g} 2 )dq 2 (we choose the integration constant 
independent of gs), so that (L 2 + L 3 ){G/p 2 ) = L 2 {G/p 2 ) = g - ( 5 ) 2 - Thus, L{f - G/P 2 ) = 
{ 9)2 + {L — L 2 — L 3 ){f — GIP 2 ), which has the desired form. Similarly, if g depends on 53 but not on 
q 2 , we use the counterterm G/pz with G = f (g — (g}^)dq 3 . And of course, if g can be decomposed as 
the sum of a function not involving 53 and a function not involving q 2 , then we can average each part 
separately and sum the two counterterms. 

It turns out that we will mostly encounter terms that depend only on one of the fast variables, and 
are therefore easy to average. We will go as far as possible averaging such terms, and then introduce a 
method to deal with the resonant terms (involving both q 2 and qs) that appear. 

3.2 Application to the central energy 

As a starting point, we use the central energy 

Hc = f + f + WL + Wc + WR. 

Definition 3.1. Let = {x € fl : p 2 ^ 0,p3 ^ 0} and let f,g be two junctions dejined on a set 
A C A*. We say that f is Oc{g) (on the set A) if there is a polynomial z such that for all x ^ A with 
min(|p 2 |) IpsI) large enough, we have 


\f(x)\ < z{pi,pi)\g{x)\ . 

Unless explicitly stated otherwise, we take A = A*. 

We state the main result of this section. 


Proposition 3.2. There is a function of the form 


u u , , P^^^ I m t\ 1-2 , I i-2x 

Hc = Hc-\ -^- \-Oc{\p2\ + P3 ) 

P2 P3 


such that 


P 2 P 3 

with aj as defined in (1.14). Furthermore, 


LHc = ~-^+ 0,{\P2\-^/^ + \P3\-^/^) , 


dp,H, = ^ + Ocipf^) , = ^ + Ocipf). 

P2 P3 


(3.2) 

(3.3) 


(3.4) 


In order to reduce the length of some symmetric formulae, we use the notation “+ as a 
shorthand for the other half of the terms with the indices exchanged as follows: 1 4, 2 3, L R, 

and the sign of wq changed (due to the asymmetry of the argument (73 — 52 of ITc)- 

In order to prepare the proof of Proposition 3.2, we proceed as follows. We first see that 


L(Hc) = -piWL -P4WK . 


Since wl does not involve qs and wr does not involve q 2 , it easy to find appropriafe counterterms: we 
introduce 


hW 


U , PiITl P41Tr 

“r “r 

P2 P3 


(3.5) 
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and obtain 


= 


-7iPiWl - pjwL + wiyVi, piWi,{wi, - wc) 


P2 


+ 


pI 


47 


The terms of order l/p 2 do not depend on and have mean zero with respect to q 2 (again wiyVi^ = 
^ 92 has zero g 2 -average by periodicity). Similarly, the terms in l/ps do not involve q 2 and 
average to zero with respect to 53 . Therefore, we introduce a next round of counterterms: 


nm = i,(i) + ( imwl'+p\w^-wll2 




pI 


+ 47 


and obtain 


-p\w-L - Syipfl^L - TiPi^L' + 4 pi1Tl'w^l + 27iTi1Tl ^ 

-p 1 - 


(3.6) 


-3^ 


Pi 


Pi 


pi 


pi 


The terms in the first line are easy to eliminate, since each one depends on only one of the fast variables 
and averages to zero. The terms 7ir(;LffL^Vpi ^^d 74ruRlT|*^Vpi the ones we are looking for, 
since after integrating by parts, we find = — 7 i(kkL )2 ~ and ( 74 ^i'Rhl^R ^)3 = 

— 74 (fk ^)3 = —as- The two “resonant” terms involving Wi^wq and W-^wq are more problematic and 
we leave them untouched for now. By introducing the appropriate counterterms (which we do not write 
explicitly), we obtain a function + Oc{\p 2 \~^ + IPal”^) such that 


lhP 


02 03 piWlwc , P^Wrwc 


pI 


Pi 


pi 


+ 


Pi 


+ Oc{\p2\ +|P 3 | 


|-3^ 


(3.7) 


In order to obtain (3.3), we must get rid of the two “mixed” terms involving Wi,wc and W-^wc, 
which are of the same order as the dissipative contributions involving 02 and 0:3. Since they each 
depend on both q 2 and ^ 3 , these terms are not easy to get rid of, due to the resonance phenomenon 
discussed above. In fact, as discussed in Appendix A, these resonances have a physical meaning. Their 
effect becomes clearly visible when Ti = r 4 = 0 (which is not covered by our assumptions): they alter 
the dynamics in the p 2 P 3 -plane, but do not prevent He from decreasing in average. We postpone to 
§3.3 the construction of the counterterms that will eliminate these resonant terms. 

We introduce next two technical lemmata and an application of Proposition 3.2. The following 
lemma is analogous to Lemma 2.3. 


Lemma 3.3. Let j £ {2,3} and r > 0. Fix an integer £> 0, and an s > 0. Let f be some Oc{\pj\ 
on the set = {x £ LI : p 2 ^ Q,P 3 ^ 0}. Then, for all sufficiently large m and R, we have 


sup |/(x)|<e. 


Proof We prove the result for / = Oc{\p 2 \ and proceed as in Lemma 2.3. By Definition 3.1 and 
(1.12), there is a polynomial 2; such that for all m and all sufficiently large R, we have on Llc{i, m, R), 


\f\< 

< 


z{piHa) ^ c + c{pI +pI)^ ^ 
\P2Y ~ \P2Y 

/ 9 9/^7 — I I 

c+ (P2 +P 2 )"' ^ C + c|p2| ^ 
\P2Y - \P2V 


c + c{pI+pI)'^ 
\P2Y 

^ I I—-r 
< C|P2| , 


where we choose N large enough and use the definition of Lie- By (1.12), we conclude that the desired 
result holds for m large enough so that — r < — ^ ■ □ 
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Lemma 3.4. Let f = Oc{p 2 ^p^) for some zi,Z 2 G M such that zi, Z 2 have the same sign. Then, 
f = Oc{\p 2 f^+^^ + \p?,V^+^^). 

Proof. We apply Young’s inequality in the form xy < x“ + if with a = >1,6= > 1, 

and X = |p 2 |^\ y = \pzf^- We obtain ^ This, and the definition of 

Oc, eomplete the proof. □ 


As a eonsequenee of Proposition 3.2 we have: 
Proposition 3.5. Let 0 < 0 < min(l/Ti, I/T 4 ) and define 


Vc = Hcd 


eHc 


1 , - 62 (^ 2 - 91 ) , Fsiqs-qf) 

-L \ Q “r 


P 2 


P 3 


with the He of Proposition 3.2 and F 2 , F 3 as in Proposition 2.4. Let i > 1 be a fixed integer. Then, 
there are constants 6 * 4 , C^, Cq > 0 such that for all large enough m and R, the following inequalities 
hold on Llc{m, £, R): 


Ci{pl +pl)e2^PHpfi <Vc< C^{pI +p|)e5(P2+P3) , 
LVe < . 


(3.8) 

(3.9) 


Proof. We first prove (3.8). By (3.2), the boundedness of the potentials, and Lemma 3.3, we have for 
m, R large enough that on Lie, 


2 2 

2 2 


< c 


and 


- 62(^2 -qi) ^ ^ 3(93 - ^ 4 ) 


P 2 


P 3 


1 

<2- 


(3.10) 


In addition, if m, R are large enough, + p| is large on Lie, so that the first part of (3.10) implies that 
c{p 2 + P 3 ) < He < c'(p 2 +^ 3 )- This and (3.10) imply (3.8). 

We next prove (3.9). Define f{s) = By Proposition 3.2, 


L{Hee^^^) = Lf{He) = f{He)LHe + f{He) Y, JbTb{dp,Hef 

6=1,4 


P2 P3 


(3.11) 


w/ 


K 


+ ( h^He + 20) ( 7iTi^ + 74T4^ + Oe{\p2\-^ + |P3r") 

P2 P3 


Now observe that for any C G M, we have 

ft + c = + OAI) = (1 + a(p7 + vD ). 


sinee trivially {p\ + p|) ^ < P 2 ^ + P 3 then, by (3.11) and Lemma 3.4, we find that 

L{Hee^Y ^ 


JHcPI +P 3 (0‘^'yiTiWl - 002 + Oei\p2\ 


pI 


+ 77 


As in the proof of Proposition 2.4, the correetions involving F 2 and F 3 replace the oscillatory terms 
and with their averages: 


LVe = Li HeC 




F 2 , T 3 


pI pI 


1 H — 5 - H — 5 - 1 + HeC^^'^L ( ^ H — 5 - 


F 2 , T 3 


pI pI 
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< e 


9H, 


pI+pI ({wl) - ea2 + Oci\p2\-^/^) 


pI 


+ 4^ 


Therefore, by the definition ( 1 . 14 ) of aj and the eondition on 9 , we have 


LVr < e 


OH^pI+pI (-C+Oc{\P2\ , -C + Oci\p3\ 


P2 


+ 


P3 


Finally, by Lemma 3 . 3 , and using that (p| + p‘^){p2 ^ + P3 > 2 , we indeed obtain ( 3 . 9 ). □ 

We now return to the proof of Proposition 3 . 2 . We need to find some counterterms to eliminate the 
mixed terms in ( 3 . 7 ). For this, we use a subdivision of A* = {x G ; p2 / 0,p3 / 0 } into 3 disjoint 
pieces, as shown in Figure 7 : 

Ai = {xe : |P2 +P 3 I > (P2 -P3f} , 

A2 = {x G A^ : {P2 -P 3 f > \P 2 +P 3 \ > {P2 -p3)^/2} , ( 3 . 12 ) 

As = {x G A : {p2 -P 3 f > 2|p2 +P3I} • 

By construction, is close to the diagonal p2 = P3, A^ is far from it, and yl2 is some transition 
region. 


P 3 



Figure 7 - Projection of the partition ,4* = U ^2 U onto the p 2 P 3 -plane. Note that 
the sets Ai, A 2 and A 3 do not include the p 2 and ps axes. 


Lemma 3 . 6 . The following holds: 

(i) On Ai U yl2, the quantity \p2 — P3\ is both Oc{\/\p2\) and Oc{\/\p3\)- 

(ii) On A2 U yl3, the quantity \p2 — P3\~^ is both Oc{\p2\~^^‘^) and OcdPsI”^^^)- 

Proof. Trivially, (i) holds because on Ai U A2, we have the scaling |p3 — P2I ^ \/\P2 + P3I ~ ~ 

y^. To obtain (ii), observe that either p2 and p^ have the same sign and by the definition of A2 U A^, 
\P2 - P3\ > sJ\P2 +P3\ = sJ\P2\ + |P3| > max(ydpd> \/M)> or they have a different sign and 

|P2 - P3I = IP2I + |P3| > max(|p2|, |P3|) ^ max(ydp2l, s/\P3\)- In both cases, we have the desired 

bound. □ 

We first work on vli U yl2. In this region, p2 and p3 are close to each other, and are both large in 
absolute value. It is then easy to find a counterterm for pi VFl wq,Ip\ and p^Wj{Wc /P3. Indeed, FFl and 
IFr oscillate very rapidly (the respective frequencies are approximately p 2 / 27 r and p 3 / 27 r), while wq 
oscillates only “moderately”, with frequency {p3 — P2) / 27 r. One can then simply average the rapidly 
oscillating part, and obtain 
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Lemma 3.7. Let R 12 = PiW'^^wc/p^ — PiW'^wc/p^. Then, 


LR 12 = 


_ PJ}^ ^ u , 


pI 


pI 


Proof. We have for the first term: 

piW^'^wc _ 


PiWi^wc ^ piW'l'^w'^. • {P3-P2) ^ 


L 


P2 


pI 


pI 




pI 

where the last equality uses Lemma 3.6 (i). A similar eomputation for the seeond term eompletes 
the proof. □ 

The counterterm R 12 works well on Ai U A 2 because |p 3 — P 2 I is small compared to p 2 and p^. 
We now have to find a counterterm R 23 that works on A 2 U A 3 and then patch the two counterterms 
together on A 2 . We state the properties of the counterterm R 23 in the following lemma, but postpone 
its construction to §3.3. 

Lemma 3.8. There is a function R 23 = Oc{\p 2 \~‘^ + IpsI”^) defined on A 2 U A 3 such that 


LR23 = 


_ PJ}^ ^ u .43) 


pI 


pI 


and 


dpiR23 = Oc{p 2 ) and dp^R23 = Oc{p^ 


(3.13) 


Assuming that Lemma 3.8 is proved, we next join the two counterterms R12 and R23 by a smooth 
interpolation on A 2 in order to prove Proposition 3.2. 

Proof of Proposition 3.2. We introduce a smooth function p : M U {— 00 , 00 } —)• [0,1] such that 
p(a;) = 1 when |x| < 1 and q{x) = 0 when |a;| > 2. We then consider the function 

. (3.14) 

P 2 +P 3 ) 

which is well-defined and smooth on the set A* = Ai U A 2 U A 3 = {x G fl : p 2 / 0 ,p 3 / 0}. 
Moreover, it is equal to 1 on Ai, and 0 on A 3 . We now omit the arguments and simply write q instead 
of (3.14). Using Lemma 3.7 and Lemma 3.8, we obtain 

L {qRi 2 + (1 — Q)R 23 ) = QLRi 2 + (1 — q)LR 23 + (i ?12 “ R 23 )Lq 

= + \PPi-^i'‘) + - R.^)Lp. 


pI 


Observe next that 

Lq = Q ■ 


Pi 


AP?.-P2) , „ ^ , {P3-P2f , . , 

2 -(ruL - tt’R - 2wc) + - -^+ wr) 


P2 +P 3 


\p 2 + P 3 \^ ~ \p 2 \^^‘^ ~ we see that Lq is 


Since g' has support in A 2 , where |p 3 — P 2 I 
simultaneously Oc{\p 2 + P 3 \~^^‘^), Oc{\p 2 \~^^‘^) and Oc(\p 3 \~^^‘^). But then, by (3.15) and using that 
Ri 2 - R 23 = Oci\p 2 \~'^ + IpsT^), we find 


L(pR,, + (1 - rffe) = 

P 2 P 3 


(3.17) 


We set now 


H, = hP + gR32 + (1 - g)R23 


From (3.17) and (3.7), we deduce immediately that (3.3) holds. Moreover, (3.4) follows from (3.13), 

f3) 

the expressions for He and iii 2 , and the fact that g does not depend on pi and p^. This completes the 
proof of Proposition 3.2. □ 


20 



3.3 Fully decoupled dynamics approximation 

We construct here the counterterm R 23 of Lemma 3.8, which eliminates the two resonant terms 
—piWi,wc/P2 and paW-rwc/Ps on A2 U A3 when both \p2\ and \p3\ are large. In this regime, all 
three interaction forces wi^,wc,w^, oscillate rapidly (since \p 2 \, \p 3 — P 2 I and |p 3 | are all large) and 
we expect the dynamics to he well approximated hy the following decoupled dynamics, where all the 
interaction forces are removed. 


Definition 3.9. We call decoupled dynamics the SDE 

dqi=pidt, i = l,...,4, 

dpfe = -ybPb dt + 2ybTbdB^ , 6 = 1,4, (3.18) 

dpj = 0dt, j = 2,3, 

with generator 

4 

L = '^Pi^q^ + ^ {-IbPbdp, + IbTbd^J , (3.19) 

i=l 6=1,4 

and denote by Ea, the corresponding expectation value with initial condition x G fl. 

We will construct two functions Ui, U 4 such that LUi = piWi^wc and LU 4 = —P 41 Lriuc- Then, 
we will introduce a change of variable x 1 —)■ x{x) such that x approximately obeys the decoupled 
dynamics, so that L([/i(x)) Ri pillLtuc and L(?74(x)) ss —PaWjiwc in the regime of interest. Finally, 
we will show that the choice R 23 {x) = Ui{x)/p\ + U 4 {x)/p^ satisfies the conclusions of Lemma 3.8. 

The decoupled dynamics can be integrated explicitly for any initial condition x = {qi,... ,^ 4 ) G fl. 
For the outer rotors b = 1,4, we have 

Pb{t) = e-^»^ph + V^b f dS,^ , 

_ - 7 .* ° ft fs (3.20) 

qb{t) = qb-\ -- Pb + V 276^6 / ( / dS^/'jds , 

76 Jo ^ Jo ^ 

and for the central ones (j = 2, 3) we simply have 


PiW =Pi > 

qj{t) = qj +Pjt (mod 27r) , 


(3.21) 


which is deterministic. We decompose the variables between the central and external rotors as 


x = {xe,Xc) •withxe = {qi,pi,q4.,Pi) a.ndxc = {q2,P2,q3,P3) ■ 

Under the decoupled dynamics, the two processes Xe{t) and Xc{t) are independent and Xc{t) is 
deterministic. Moreover, under the decoupled dynamics, Xe{t) has the generator 

Le= {Pbdq, - JbPbdp, + IbTbdpJ , 

6=1,4 

and admits the invariant probability measure vfe on (T x M)^ given by 

d7fe(xe) = —dgi dpi dq^ dp4 , 

Zj 

where Z is a normalization constant (recall that Ti, T 4 > 0 by assumption). 
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Definition 3.10. We denote by S the set of functions f £ M)/or which the norm 


\f{x)\ 


xeSl ^ ~T~ Pi ~r Pi 

is finite. We denote by Sq the subspace of functions f £ S for which 


(3.22) 


/ f{Xe,Xc) d-KeiXe) = 0 far all Xc G (T X M)^ . 

2(TxM)2 

We will later eonsider / = piWi^wc and / = —piWuivc, whieh are manifestly in 5o. 
Lemma 3.11. There are constants (7*, c* >0 such that for all f £ Sq, all x £ Tl, and all f > 0, 


|E./(x(t))| < C,e-^-*|||/||| (1 +pI +pI) . (3.23) 

Proof As mentioned, Xe{t) and Xc{f) are independent under the decoupled dynamics. Introducing the 
expectation value E® with respect to the process Xe{f) under the decoupled dynamics, we obtain that 
for any function / on 12, 

%f{x{f)) = ¥f.J{xe{f),Xc{f)) , (3.24) 

where Xc{t) is (deterministically) given by (3.21). 

The process Xe(f) under the decoupled dynamics is exponentially ergodic, with the unique invariant 
measure vfe defined above. Indeed, one can check explicitly that this measure is invariant, and 
introducing the Lyapunov function Ve{xe) = 1 + pf + p\, we easily obtain that LeVe < c — cVe- It 
follows from [23, Theorem 6.1]^ that there are two constants C*, c* >0 such that for any function 
5( : (T X M)^ — )■ M such that 51/14 is bounded. 


sup 

Xe 


- Tte{g)\ 
I+pI+pI 


<(7*6 “^‘^sup 

Xe 


\g{Xe) - 7re(g)| 
I+p\+p\ 


(3.25) 


Let now / G 5o. For any fixed n G (T x M)^, we apply (3.25) to the function gv{xe) = /(xe, v). Since 
/ G 5o, we have Tte{gv) = 0. Therefore, for any f > 0, 


sup 

Xe 


\^'xJ{Xe{t),v)\ 


< (7*6 




< C* 6 -"** 


(3.26) 


1 + Pi + P4 * Xe Pi +pI 

This holds for all v, and in particular for v = Xc{t). Therefore, by (3.24), we have the desired result. □ 
The next proposition constructs a right inverse of L on 5o [24-26]. We use here the notation 


x = (xi,...,X8) = {qi,---,qi,Pi,---,P4) ■ 


(3.27) 


Proposition 3.12. Let f £ Sq be a function such that for all multi-indices a, we have d-f £ Sq, and 
let 


{Kf){x) = - E^f{x{t)) dt . 

Jo 


(3.28) 


Then: 

(i) Kf and its derivatives of all orders are in S. 

(ii) We have 

LKf = f. 


"'One should also check that there is a skeleton with respect to which every compact set is petite. This is obvious, but can 
be proved with methods similar to those of §5 in [6]. 
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Proof. By Lemma 3.11, the integral (3.28) converges absolutely for all x and we have Kf G S. We 
now prove the result about the derivatives. By (3.20) and (3.21), we can write 

dxAf) , , , 

^ = /i.,(t), (3.29) 

where the hij are deterministic functions of t only that grow at most linearly (namely 0, 1, 6“'*''’*, 
(1 — e “^*’*)/76 and t). We then have 

8 - ® - 

— {%f{x{t))) =^hij{t)E4{dif){x{t))] . 

^ i=l 

For the derivatives of order n, we find by induction 

n 

(E^/(x(f))) = ^ [Ylhi^j^{t)^Ea:[{di,^...,i„f){xit))] , (3.30) 

. ,iti 

where the sum is taken over all (ii,..., i„) G {1, 2,..., 8}*^. Since by assumption G 5o, we 

have by Lemma 3.11 that 

\^x[{di^,...,i„f){x{t))]\ < {I+pI+pD ■ 

But then, by (3.30), we have 

roo 

\di^,...,i„Kf{x)\ = / {E^f{x{t))) dt 

Jo 

<c{1+p‘1+pI) [ (n/i4ifcW)e"''**df 

fc=l 

Since the hij grow at most linearly, the time-integrals in the right-hand side converge. Therefore, Kf 
is C°° and (i) holds. 

For the second statement, we observe that 

roo roo j 

LKf = -j LExf{x{t))dt = - j —^xf{x{f))dt = %f{x{d)) = f{x), 

where we have used that lim^^oo '^xf{x{t)) = 0 by (3.26). □ 

Remark 3.13. The proof of Proposition 3.12, and in particular (3.29), relies on the linear nature of 
the decoupled dynamics. If we add constant forces ri and at the ends of the chain (as in [6]), the 
method above applies with little modification, and with the replacements Pb ^ Pb — Xb/^b^ b = 1,4, 
in the invariant measure vfe. Flowever, if we add pinning potentials of the kind U{qi), the decoupled 
dynamics cannot be solved explicitly, and we do not have (3.29) for some deterministic functions 
hij{t). Although we believe there exists an analog of Proposition 3.12 in that case, we are currently 
unable to provide it. The situation is even worse in the simultaneous presence of constant forces and 
pinning potentials. In that case, the expression of vfe is not known [14], which makes it difficult to 
decide whether a given function is in 5o. (Of course, although there is no difficulty there, the averaging 
of p 2 , P 3 and He also needs to be adapted to accommodate for such modifications of the model.) 

We now have an inverse of L on a given class of functions. We next use it to find an approximate 
inverse of L. The key is to introduce a change of variables x = (gi, pi,..., qi^pf) such that for nice 
enough functions /, it holds that L{f{x)) ^ {Lf){x) in the regime of interest. Here and in the sequel. 
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it is always understood that x is viewed as a function of x. We compare the actions of L and L in 
Lemma 3.14. We state this lemma with the notation (3.27), and write generically 

L = '^{bi{x)di +Gidf) and L = '^(hi{x)di + (Jidf) . (3.31) 

i i 

In our case, only as and a%, which correspond to the variables pi and p 4 , are non-zero. 

Lemma 3.14. Consider a change of coordinates x i—)• x{x) = x + s{x), defined on some set Oq C fl. 
Assume that for all j, 

L{xj) = bj{x) -f ej{x) 

for some £j. Then, for any smooth function h, we have for all x G flo that 

L{h{x)) = {Lh){x) + C(x) , 


where 

Cix) = ^idjh)ix)ej{x) + 2 ^ (^) “1“ ^ ^ (^) • (3.32) 

j i,k 

Proof. We do the eomputation for the ease of just one variable x E M. Let g{x) = x{x) = x + 5(x). 
From the definition of L and L, and sinee by assumption Lg = ho g + find 

L{h o g) = {h' o g) ■ Lg + a ■ {h" o g) ■ g’^ 

= {li og)-(fog^e)^o ■ {h” o g) ■ g'"^ 

= (Lh) og + {h' og) -e + a ■ {h” o g) ■ {g'^ - 1) 

= {Lh) o g + {h' o g) ■ e + a • {h" o g) • (2s' -f s'^) . 


The desired result follows from generalizing to the multivariate case. 

We consider now the following change of variables defined on A 2 U A^: 

Wl {q2 - qi) 


□ 


qi = qi, 

72 = 72 , 


Pi = Pi 


P2=P2 + 


= Pl +Oc{p2^) , 


P2 


_ M-'c (l:< - 12) OM-'A 

P2 P3 - P2 


(3.33) 


with analogous expressions for the indices 3, 4. Here, we have used Lemma 3.6 (ii) to replace 
Wl/{P 3 — P 2 ) with Oc{\p 2 \~^''‘^)- Straightforward computations show that, on yl 2 U A 3 , 


L{pi) = -yipi + Oc{pf^) = -yipi + Ocipf^) , 
L{qi) =pi=pi + Oc{pf^) , 

L{p2) = 00 ( 1 ^ 21 “^ + {P2-P3)~^) = Oc{pf^) , 
L{q2) =P2=P2 + Oc{\p2\~^^^) , 


(3.34) 


with similar expressions for the indices 3,4 (we have again used Lemma 3.6 (ii)). 

While one could choose a more refined change of variables by going fo higher orders, fhe change 
(3.33) is good enough for our purpose. 


Lemma 3.15. Let f G S. Then f is Oc(l)- Moreover, given any function ^ H —)■ [0,1], we have that 
f{x + C{x){x — x)) = Oc{l) on A 2 U A 3 . In particular, f{x) = Oc{l) on A 2 U ^ 3 . 
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Proof. By assumption, |/(x)| < |||/|||(1 + Pi + p\) on ri, so that immediately / = Oc(l). Moreover, 
f{x + C{x){x — x)) is well-defined on A 2 U A 3 , and 

\f{x + C{x){x-x)) \ < lll/lll ^1-h -h (^p 4 -({x)^'^ 

whieh is indeed a Oc(l) on this set. The claim about f{x) follows from the choice ^ = 1. □ 

Proposition 3.16. Let f satisfy the assumptions of Proposition 3.12, and consider the change of 
coordinates (3.33). Let h = Kf. Then, on the set A 2 U ^3 (meaning that we take x G ^2 U A 3 , and 
not necessarily x G A 2 U A 3 ), we have h(x) = Oc(l) and 

L{h(x)) = fix) + Oci\p 2 \~^^‘^ + \P3\~^^^) ■ 

Proof. We use again the notations x = (xi,...,xs) = (qi,... ,q 4 ,pi,... ,^ 4 ) and (3.31). We apply 
Lemma 3.14 with the coordinate change x = x + s(x) defined by (3.33). Then, the Sj are given by 
(3.33), and the Sj are given by (3.34). Observe then that on ^42 U yls, all the Sj and £j and are at 
most Oci\p 2 \~^^‘^) or The only non-zero cjj are <75 = 71 Ti and cts = 74 T 4 . Moreover, 

dx^Sj = dp^Sj = 0 for all j G {1, 2,..., 8 }, and similarly dx^^Sj = dp^Sj = 0. Therefore, from 
(3.32) we are left with (^(x) = 'f2j{djh){x)£jix). We now apply this to the function h = Kf. By 
Proposition 3.12, we have Lh = /, so that 

L(/i(x)) = fix) -h '^idjh)ix)£jix) . ^3 35 ^ 

j 

To obtain the desired results, it remains to make the following two observations. First, by the mean 
value theorem, there is for each x some ^(x) G [0,1] such that on yl 2 U A 3 , 

fix) - fix) = Sjix)idjf)ix + ^(x)s(x)) = C>c(|p 2 r^'^^ + , (3.36) 

j 

where we have applied Lemma 3.15 to djf, which is in S by assumption. Secondly, using Lemma 3.15 
and the fact that djh G 5 by Proposition 3.12, we find 

Yidjh)ix)ejix) = Oci\p2\~^^‘^ + \P3\~^^‘^) , 
j 

which, together with (3.35) and (3.36), completes the proof. □ 

We are now ready for the 
Proof of Lemma 3.8. Let 



Uiiqi,.. .,q3,Pi, ■ ■ ■,P3) = KipiWLiq 2 - qi)wciq3 - ^ 2 )) , 
UAiq 2 ,.. ■,q4,P2, ■ ■ ■,P4.) = Ki-PAWviiq3 - q4)wciq3 - ^ 2 )) , 


and 


R23ix) = 


Ulix) , Uiix) 


pI 


+ 


Pi 


That f7i depends only on (gi,..., 53 ,^ 1 ,... ,^ 3 ) follows from the independence of the four rotors 
under the decoupled dynamics. Similarly for C/ 4 . It is easy to check that / = piWi,wc. satisfies the 
assumptions of Proposition 3.12: Since (/)j^ = 0, we also have (d-f)^ = 0 for each multi-index a. 
From this it follows that iteif ) = 0 and that Tt^id-f) = 0, since 7 fe is uniform with respect to qi. Since 
no powers of pi or p^ appear upon differentiation, we indeed obtain that / and all its derivatives are in 


25 



5o. A similar argument applies to / = —p^Wi^wc- Therefore, applying Proposition 3.16, we find that 
on the set A 2 U As, the functions Ui{x) and Ui{x) are Oc{l), and that 


L{Ui{x)) = PiWlWc + Oci\p 2 \ ^^‘^ + \P3\ , 

L{Ui{x)) = -paWkwc + Oci\p2\~^l’^ + • 


In (3.37), the arguments of ITl, ITr and Wq are indeed x and not x. Finally, we have 


Ti?23 


L([/i(x)) L([/4 (x)) 

2 ' 2 

pi pi 


+ (^c{P2 + C’cCPs • 


(3.38) 


The main assertion of the lemma then follows from this, (3.37), and Lemma 3.4. The assertion 
(3.13) follows from the definition of R 23 and the following observation: using the explicit expression 
for X, Proposition 3.12 (i) and Lemma 3.15, we obtain dp^{Ui{x)) = {dp^Ui){x) = Oc(l), and 
dp^{Ui{x)) = {dpJJi){x) = 0 (and similarly for UA- □ 


Remark 3.17. The construction above relies on the strict positivity of the temperatures (which we 
assume throughout). Nonetheless, it can be adapted to the case Ti = T4 = 0. In this case, the 
external rotors are not ergodic under the decoupled dynamics: they deterministically slow down and 
asymptotically reach a given position that depends on the initial condition. Therefore, the conclusion of 
Lemma 3.11 does not hold. However, the counterterm R 23 that we obtained still produces the desired 
effect. Indeed, at zero temperature, the definition of Ui becomes 

poo 

Ui{x) = - Pi{t)WLiq2{t) - qi{t))wc{q3{t) - q2{t)) , 

Jo 

where x{t) is the deterministic solution given in (3.20) and (3.21) with initial condition x and Ti = 
T4 = 0. Since pi (t) decreases exponentially fast and Wi^wq is bounded, this integral still converges. A 
similar argument applies to U 4 . 


4 Constructing a global Lyapunov function 


We construct here the Lyapunov function of Theorem 1.4. We start by fixing fhe paramefers defining 
fhe sefs 172, ^3, and fhe functions V2, T3,14- 

We assume fhroughouf fhis secfion fhaf 9 is fixed and satisfies 


0 < 0 < min 


1 1 


(4.1) 


This condition is necessary fo apply Proposition 2.4 and Proposifion 3.5. In addition, if guarantees 
fhaf when pi + p\ is large, exp(0i7) decreases very fasl: 

Lemma 4.1. There are constants Cj, Cg > 0 such that 


<{Ct-CM+P l)y^ ■ 

Proof. Since 4 (—7b6*(l — OTfjpl + '^hOTfj , fhe resulf follows from fhe condition 

on 6 . □ 


We nexf choose fhe consfanfs k, I, a, m, and finally R. Firsf, we fix k large enough, and require a 
lower bound i?o on R so fhaf fhe conclusions of Proposifion 2.4 hold on Qj {k, R), j = 2,3. We fhen 
fix fhe paramefers a (appearing in V2, T3) and I such fhaf 


2 2 


(4.2) 
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As a consequence, itt-, R) now depends only on m and R, which we fix large enough so that 
Proposition 3.5 applies, and so that m > £ and R > Ro- 

This choice satisfies the condition 1 < k < £ < m imposed in (1.8). This ensures that the sets 
Qj (j = 2, 3) and Qc have “large” intersections, and that they indeed look as shown in Figure 4 and 
Figure 5. Moreover, condition (4.2) ensures that for large \pj\, j = 2, 3, 

ip,f/^«ip,r«ip.'p/", 


which will he crucial. 

We next introduce smooth cutoff functions for the sets £ 1 . 2 , ^ 3 , For this, we consider for each 
set a thin “boundary layer” included in the set itself. 

Definition 4.2. Let V be a subset of the momentum space M^. We define B{V) = {p £ V : 
dist(p, 7^“^) < 1 }. 

Lemma 4.3. Let V C M^. Then, there is a smooth function ^ [0,1] with the following 

properties. First, 'f{p) = 1 onV \ B{V) and 'f{p) = 0 on V^, with some interpolation on B{V). 
Secondly, d-f is bounded on 'Sf for each multi-index a. 

Proof Such a function is obtained by appropriately regularizing the characteristic function of the set 
{p G-P : dist(p,'P'=) > 1/2} C Ml □ 

Since the definition of sets flc and £lj, j = 2,3, involves only the momenta, we can write 
lie = X Vc and Qj = x Vj for some sets Vc, Vj C M^. We apply Lemma 4.3 to Vc, P 2 and V 3 , 
and denote by and the functions obtained. We introduce also the sets 

B{nc)=T^ xB(Vc), Bin 2 )=T^ xB(r 2 ), Bins) = X BiVs) ■ 

Obviously, P(Oc) C Oc and BiQj) C Oj. 

Proof of Theorem 1.4. We show that the Lyapunov function 

V = l + e^^ +Y, fjip)Vj + MMp)Vc 

J=2,3 

has the necessary properties, provided that the constant M is large enough. We start by proving (1.5). 
From (2.15) and (3.8), we immediately obtain the bound 

1 + <V < c('02e'^^'“ + V'sel®! + . (^2 p 2 -^yeH ^ (■ 43 ^ 

which is slightly sharper than (1.5). We next turn to the bound on LV. We introduce the set 

G = {x £ n : Pi p1 < (1 + C' 7 )/C' 8 } , (4.4) 

with C 7 , Cs as in Lemma 4.1, so that 

Le^H < _^eH ^ ^ C7)lGe^^ < (4.5) 

for some Cg > 0, where we have used that H <c + ^ + ^onG. Moreover, observe that for j = 2,3, 
there is a polynomial Zj (p) such that 

LifjVj) = ^jLV, + VjLf, + 2 ^ jbTb{dp,fij){dp,Vj) 

b=l,4 

< + VjLfij + 2 ^ lbTbidp,fij)idp,Vj) (4.6) 

6=1,4 

< i-ln^CspJ^ + lH(Q^.)Zj(p))ell“+H , 
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where the first inequality follows from (2.16) and the second inequality holds because the derivatives of 
V’j(p) have support on B{Qj), because \Tpj — l and because of (2.15). Similarly, using 

Proposition 3.5, we obtain a polynomial Zc{p) such that on Q 

LiMp)Vc) < . (4.7) 

Combining (4.5), (4.6) and (4.7), we find 




J=2,3 


i=2,3 


(4.8) 


The first line contains the “good” terms. We next show that these terms dominate the others. Let 
e > 0. We claim that there is a (large) compact set K (which depends on e) such that 






e /„2 , 


<ee'^^ + cli^ , j = 2,3. 

(4.9) 

< + e J] In.pfe^P^^^+'^P" + cIk , 

(4.10) 

i=2,3 



(4.11) 

i=2,3 



We prove these bounds one by one. 

• Proof of (4.9). We prove the bound for j = 2. First observe that 

^2(p)elP2l“+fP2-®^ < cz2(p)el^’2l“-5(P?+P3+p|) . (4.12) 

By the dehnition of ^ 2 , when ||p|| —^ 00 in 8 (^ 2 ), we hnd pf + + p| ~ |p2p'^^ |P2|“ 

(recalling that 2/k > a). Thus, the right-hand side of (4.12) vanishes in this limit, since Z 2 is 
only a polynomial. This implies (4.9) if K is large enough. 


• Proof of (4.10). By inspection of the definition (1.10) of Qc, there are three regions 
i = 1,2,3, such that if the compact set K is large enough. 


C iT U S(F!e)l u B{n,)2 U S(F!e)3 , 


where B{^c)i is such that P 2 + Pi ~ (Pi + P 4 )™’ where B(flc )2 C ^2 is such that ~ P 2 ’ 
and where B{Qc )3 C ^3 is such that ~ Pg (see Figure 4 and Figure 5). 

Since 2;c(p)e2i^2+P3) is bounded on compact sets, (4.10) trivially holds on K. We next turn to 
B{Q,c)i- We have 

The right-hand side vanishes when ||p|| —)■ 00 in B{Q.c)i, and thus by enlarging K if necessary, 
we find Zc(p)e2i^2+P3) < -I- clx on B{Q.c)i, which implies (4.10). 

Now, consider B{Qc) 2 - We have 


z,{p)el^Pl+Pl) 

p-2giP2i“+fpi 


Zc{p)p2e'^P^ 


As IIpII —)■ 00 in .6 (flc) 2 , the right-hand side vanishes, since pg ~ |p 2 p/^anda > 2/i. Therefore, 
2 ^c(p)e2if2-i-P3) < ep“^elf 2 l“+ 2 P 2 q- on B{Qc )2 for large enough K, and thus (4.10) holds 

on.B(nc)2 since .B(nc)2 C ^ 2 - 

A similar argument applies for B{Q,c) 3 , which completes the proof of (4.10). 
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• Proof of (4.11). Observe that for K large enough, the set G defined in (4.4) verifies 

G C X U O2 U Oc U O3 . 


On K and Oc, (4.11) holds frivially. On G n O 2 \ Oc and for large enough ||p||, we have 
P 3 < (ofherwise we would have x G Oc), and Iherefore 


ef(pi+pi) 

p-2gb2i“+fpi 


= pieiP3-b2l° 




Sinee a > 2j£, and by enlarging K if necessary, we have < £p~‘^e^P'^^‘^~^ 2 P 2 -y cl^ on 

G n O 2 \ Oc, so fhaf (4.11) holds on fhis sef. Since a similar argumenf applies in G n O 3 \ Oc, 
fhe proof of (4.11) is complete. 


Subslifufing (4.9), (4.10), and (4.11) info (4.8), we find 

LV < -(1 - 2e - Me)e®^ - ^ lo.(G 3 - Gge - Me)p-2elP^I“+tpI 

i=2,3 

- ln,(MG6 - G9)e^(?’2+Pi) + cIk • 


Since fhe consfanfs G* do nof depend on e and M, we can make fhe fhree parenfheses (1 — 2e — Me), 
(G 3 — Gge — Me) and {MCq — Gg) positive by choosing M large enough and fhen e small enough. 
Using again (2.15) and (3.8), we finally obfain 

LV < -ce^^ - c - clttc- 2 ^^ + cIk ■ (4.13) 

i=2,3 Pj ^2 + P 3 

We now show fhaf fhis implies (1.6). Observe fhaf since V > , we have log V > 6 H, and Iherefore, 

forj = 2,3. 


< P 2 + pI ^ cM + c < c log U + c < c(log U + 2 ) 


Since also I{2 + log V), we obfain by (4.13) fhaf 


’I 

^en 


+ lu,V2 + lnsV3 + ln.Vc 


LV < -c 


2 + log V 

which, by fhe definition (1.7) of p, proves (1.6). 


+ cIk = - 


cV 


2 + log(U) 


+ cIk , 


□ 


5 Proof of Theorem 1.3 

Now fhaf we have a Lyapunov function (Theorem 1.4), we can prove Theorem 1.3 in fhe spirif of [6]. 
In addifion fo Theorem 1.4, we need a few ofher ingredienfs. 

We firsl use fhe resulf of [8] abouf subgeomefric ergodicify. We sfafe if here in a simplified form. 
For a definifion of “irreducible skelefon” and “pefife sef”, see fhe infroducfion of [8] or [6, Section 2]. 

Theorem 5.1 (Douc-Forf-Guillin (2009)). Assume that a skeleton of the process (L2) is irreducible 
and letV:Ll^ [1, oo) be a smooth function with lim||p||^oc V (q,p) = +oo. If there are a petite set 
K and a constant C such that LV < Gli^- — p{y)for some differentiable, concave and increasing 
function p : [ 1 , oo) —)■ ( 0 , oo), then the process admits a unique invariant measure vr, and for any 
z G [0,1], there exists a constant C' such that for allt>0 and all x G Ll, 

\\P\x, •) - 7 r||(^oV). < g{t)C'V{x) , (5.1) 

where g{t) = (p o with H^{u) = 
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Proof. This is a combination of [8, Theorems 3.2 and 3.4] for the following “inverse Young’s functions” 
(in the language of [8]): 'Ti(s) oc and '^ 2 ( 5 ) oc . □ 


Theorem 1.4 provides most of the input to Theorem 5.1, but we still need to check that there is an 
irreducible skeleton, and that the set K of Theorem 1.4 is petite. To this end, we introduce, as in [ 6 ], 

Proposition 5,2. The following holds. 

(i) The transition probabilities P^{x,dy) have a C°°((0, 00 ) x 12 x density pt{x,y) and the 
process is strong Feller. 

(ii) The time-1 skeleton chain (xn)n=o,i, 2 , -' admits the Lebesgue measure on (12, B) as a maximal 
irreducibility measure. 

(Hi) All compact subsets of Ft. are petite. 

Proof, (i) follows from Hormander’s condition. The proof that Hdrmander’s condition holds, which 
relies on Assumption 1.2, is very similar to that of Lemma 5.3 of [ 6 ] and is left to the reader. The 
proof of (ii) is exactly as in Lemma 5.6 of [ 6 ], and (iii) follows from (i), (ii), and Proposition 6.2.8 
of [ 22 ]. □ 

We can now finally give the 

Proof of Theorem 1.3. Let 0 < < min(l/Ti, I/T4) and 62 > 61 . Choose now 6 G ( 6 * 1 , 02 ) 

such that 9 < min(l/Ti, I/T4). By Theorem 1.4, we have a Lyapunov function 1 + < V < 

c(elP 2 |“ _|_ ^ 2 ) such that LV < cIk — </ 5 (L), where (p{s) = C 3 s/{2 + log(s)), 

and where AT is a compact (and therefore petite) set. Let now 2 ; G (0,1) be such that z 6 > 61 . By 
Theorem 5.1, we obtain the existence of a unique invariant measure tt such that 

\\P\x, ■) - 7r||(,^oy)- < ce“^*^^V(x) , (5.2) 

where we have used that with the notation of Theorem 5.1, 

g{t) = {(fo (5.3) 

for some A > 0. Indeed, H^{u) = ^ /“ = ^(logu)'^ + ^logu, so that = 

exp(( 2 c 3 f + 4)^/^ — 2 ) and {(p o + 4)“*^/^ exp(( 2 c 3 f + 4)^/^ — 2 ) > , which 

implies (5.3). 

Then, (1.4) follows from (5.2) and the following two observations. First, we have V < since 
9 < 02 - Secondly, by our choice of z, we have < c(ip o V)^, so that \\P^{x, ■) — Tr\\f,eiH < 
c||P*(x, •) -7r||(^oy)-- 

Thus, we have proved (iii). Since (i) and the smoothness assertion in (ii) follow from Proposition 5.2, 
the proof is complete. □ 

Remark 5.3. It would of course be desirable to generalize Theorem 1.3 to longer chains of rotors. The 
proof of Proposition 5.2 carries on unchanged to chains of arbitrary length. Therefore, in order to prove 
the existence of a steady state and obtain a convergence rate (with Theorem 5.1), it “suffices” to find an 
appropriafe Lyapunov funcfion. We expecf fhe convergence rafe fo be limifed by fhe cenfral rotor (if fhe 
lengfh of fhe chain is odd) or fhe fwo cenfral rofors (if fhe lengfh is even). Preliminary sfudies indicafe 
fhaf for chains of lengfh n, a convergence rafe exp(—cf*') wifh k = l/{2\n/2] — 2 ) is fo be expected. 
Obfaining such a resulf raises some major technical difficullies. Firsf, fhe averaging procedure has fo 
be carried to much higher orders, which quickly becomes intractable if we proceed explicitly, as we 
do here. Moreover, the number of regimes to consider grows very rapidly with n. And finally, some 
generalizafion of Proposition 3.12 to more general (nonlinear) decoupled systems will be needed, wifh 
fhe difficullies mentioned in Remark 3.13. We are frying to solve fhese issues by developing a inductive 
mefhod which requires fewer explicil calculations, bul much work remains fo be done. 
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A Resonances in the deterministic case 

In §2, two resonant terms appeared, namely piWi,wc/p^ and —paW-^wc/p\- These terms have a 
physical meaning. We start with the case where Wi(s) = — cos(s), I = L, C, R. Then, 

jrr ( N ■ / N siniqi-qs) sm{ 2 q 2 - qs - qi) .... 

W-LWc = - cos(g 2 - qi) sm(g 3 - q 2 ) = -^-^-^- . (A.l) 

Consider now the regime where most of the energy is concentrated at sites 2 and 3. In the approximate 
dynamics (3.1), we see that sin(gi — ga) oscillates with frequency P3/27r and mean zero, while 
sin(2g2 — q'i — qi) oscillates with frequency {2p2 — ps) /27r. When p^ = 2 p2, the second term does 
not oscillate. 



P2 

Figure 8 - Projection of a few orbits on the p 2 P 3 -plane, with Wl = Wc = Wr = — cos, 

7 i = 74 = 1, Ti = Ti = 0. The resonances are depicted as dashed lines. 

In Figure 8 , we represent some trajectories projected onto the p 2 F 3 -plane in the deterministic case 
{i.e., Ti = Ta = 0). We observe that some trajectories are “trapped” hy the line pzlp 2 = 2, while 
some others just cross it. By symmetry, the same happens when pzlP 2 = 1/2 because of the term 
—PaWj{Wc/p\- This phenomenon does not occur when the same conditions are used with positive 
temperatures (see Figure 3). A finer analysis (not detailed here) shows that in the resonant regime 
PzIp 2 = 2 , a net momentum flux from p 3 to p 2 appears, and similarly for pzlP 2 = 1/2 with a flux 
from p2 to p 3 . These fluxes stabilize the resonant regimes. 

If we take ITi(s) = — cos(nis) for some ni G Z*, I = L, C, R, we find by a decomposition similar 
to (A.l) some resonances at 

^ ^ f nc + rtL nc - rtL rap _ np ) 

P2 \ nc ’ np ’ nc + hr ’ np - UR / ■ 

(If some of these values are 0 or oo, we exclude them since our approximation is reasonable when both 
IP 2 I and |p 3 | are very large.) For example, if we choose (np, np, nR) = (3,1, 3), we obtain the ratios 
P3/P2 = 4,1/4, —2, —1/2, which we indeed observe in Figure 9. 

Of course, a similar analysis applies to more general interaction potentials by taking their Fourier 
series and treating the (products of) modes separately. 
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Figure 9 - Projection of a few orbits on the p 2 P 3 -plane, with 71 = 74 = 1, Ti = T4, = 0, 
and (riL, nc, tir) = (3,1, 3). The resonances are depicted as dashed lines. 
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